Enable Custom Models #461

enochlev · 2023-09-30T17:16:02Z

Problem statment: There are alot of powerfull LLMs backend with terrible UI and alot of nice front end LLMw with bad/unscalable backend.

This PR will allow to use custom models from a private server hosting openai compatible API

Requirments:

a backend with openAI compatble API
here are some popular ones
FastChat, FastChat is an open platform for training, serving, and evaluating large language model based chatbots. Includes scalability capabilities
vLLM faster/scalable version of hosting LLMs
LMDeploy Less popular, but more faster/scalable version of vLLM
llama-cpp-python, a Python library with GPU accel, LangChain support, and OpenAI-compatible API server.
text-generation-webui, the most popular web UI. Supports NVidia CUDA GPU acceleration.
LM Studio, a fully featured local GUI with GPU acceleration on both Windows (NVidia and AMD), and macOS.
ctransformers, a Python library with GPU accel, LangChain support, and OpenAI-compatible AI server.

this will make libraries like [llama-gpt]https://github.com/getumbrel/llama-gpt
obsolete as the focus of this project is nice UI, but has a unscallable backend

Here is how to test it

host one of the models above and get a working vLLM backend.
I recommend vLLM due to its powerfull backend
Host the server via ngrok

To skip the 2 steps above test using my api server... Ill host it for a week or untill this PR is closed

API Endpoint: https://major-collie-officially.ngrok-free.app/v1/chat/completions API key EMPTY

Enter in your credentials into the app
Choose a model supported by your API
Start chating on a local private server!!!

mratanusarkar · 2023-10-09T05:43:53Z

I tried bettergpt.chat by setting API as:
API Endpoint: https://major-collie-officially.ngrok-free.app/v1/chat/completions and API key as EMPTY
on checking "Use custom API endpoint"

but on clicking "Model" on top, I don't get vicuna-7b-v1.5 in the dropdown.
@enochlev could you help on what I am doing wrong?

Also, a follow-up question on how to use private Models with this UI, without using "api.openapi.com"?

enochlev · 2023-10-12T16:52:42Z

I closed down the server as I said I would host it for a week... I reopened it up.

here is the error you should get

And my code should allow you to add your own custom model into the text box.

I guess after thinking about it for a while, a better option is to send a request to the openai endpoint and check for existing models and add them into the dropdown instead of having the user manual edit it.

7flash · 2024-01-28T06:43:53Z

I closed down the server as I said I would host it for a week... I reopened it up.

here is the error you should get

And my code should allow you to add your own custom model into the text box.

I guess after thinking about it for a while, a better option is to send a request to the openai endpoint and check for existing models and add them into the dropdown instead of having the user manual edit it.

I think we need both, requesting /models endpoint to show default dropdown but still allow user to add custom models in case if /models endpoint did not return all of them.

Enable Custom Models

62a9a80

enochlev mentioned this pull request Sep 30, 2023

Third Party UI Example lm-sys/FastChat#2499

Merged

Fix Uneeded Changes

3e6b2ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Custom Models #461

Enable Custom Models #461

enochlev commented Sep 30, 2023

mratanusarkar commented Oct 9, 2023

enochlev commented Oct 12, 2023

7flash commented Jan 28, 2024

Enable Custom Models #461

Are you sure you want to change the base?

Enable Custom Models #461

Conversation

enochlev commented Sep 30, 2023

Requirments:

mratanusarkar commented Oct 9, 2023

enochlev commented Oct 12, 2023

7flash commented Jan 28, 2024